Documentation Index
Fetch the complete documentation index at: https://superdoc-dependabot-npm_and_yarn-npm_and_yarn-e04d5d616f.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Supported Input Formats
Currently, SuperDoc API supports DOCX (Microsoft Word) files as input:DOCX (Microsoft Word)
.docx - Office Open XML format - Full formatting support - Maximum file
size: 25MB (authenticated) / 5MB (public) - Supports all modern Word featuresDOCX Compatibility
SuperDoc is compatible with DOCX files created by:- Microsoft Word (2007 and later)
- Google Docs (exported as DOCX)
- LibreOffice Writer (saved as DOCX)
- Apple Pages (exported as DOCX)
- Other Office suites that support OOXML
Looking for support for other input formats? Contact
us to discuss your needs.
Supported Output Formats
SuperDoc converts documents to PDF format:PDF (Portable Document Format)
.pdf - Adobe PDF format - High-fidelity conversion - Preserves
formatting and layout - Universal compatibility - Optimized for web and printPDF Features
Our PDF output includes:Text Preservation
- All text remains selectable - Fonts embedded or substituted - Character encoding preserved
Layout Fidelity
- Page dimensions maintained - Margins and spacing preserved - Column layouts intact
Interactive Elements
- Clickable hyperlinks - Bookmarks for navigation - Form fields (when applicable)
Metadata
- Document properties - Creation timestamps - Author information
File Size Considerations
Input File Limits
| Endpoint Type | Maximum Size | Recommended Size |
|---|---|---|
| Authenticated | 25MB | < 10MB |
| Public | 5MB | < 2MB |
Output File Optimization
SuperDoc automatically optimizes PDF output:- Image compression - Reduces file size while maintaining quality
- Font subsetting - Includes only used characters
- Metadata cleanup - Removes unnecessary data
- Structure optimization - Efficient PDF organization
Format-Specific Features
DOCX Input Features
Text and Paragraphs
Text and Paragraphs
- Rich text formatting (bold, italic, underline, etc.)
- Multiple font families and sizes
- Text colors and highlighting
- Paragraph alignment and spacing
- Bullet points and numbered lists
- Indentation and hanging indents
Document Structure
Document Structure
- Multiple sections with different formatting - Headers and footers (including different first page) - Page numbers and fields - Table of contents - Footnotes and endnotes - Page and section breaks
Tables and Objects
Tables and Objects
- Complex table structures - Cell formatting and borders - Table styles and themes - Merged cells and nested tables - Images and embedded objects - Charts and SmartArt graphics
Advanced Elements
Advanced Elements
- Hyperlinks (internal and external)
- Cross-references
- Comments and track changes*
- Form fields and controls
- Text boxes and shapes
- Watermarks and backgrounds
PDF Output Characteristics
PDF Version
PDF Version
- PDF 1.4 compatibility - Ensures maximum compatibility
- PDF/A compliance available on request
- Searchable text - All text remains selectable
- Vector graphics - Scalable elements preserved
Security Features
Security Features
- No password protection by default - Print permissions enabled - Copy permissions enabled - Modification restrictions available on request
Accessibility
Accessibility
- Tagged PDF structure
- Alternative text for images (when present in source)
- Reading order preservation
- Logical document structure
Quality Comparison
Before and After Conversion
| Element | DOCX Source | PDF Output | Fidelity |
|---|---|---|---|
| Text formatting | ✅ Rich formatting | ✅ Preserved | 99.9% |
| Page layout | ✅ Complex layouts | ✅ Maintained | 98% |
| Images | ✅ High resolution | ✅ Optimized | 100% |
| Tables | ✅ Complex tables | ✅ Preserved | 97% |
| Hyperlinks | ✅ Active links | ✅ Clickable | 100% |
| Comments | ✅ Reviewers’ notes | ⚠️ Flattened | N/A |
Future Format Support
We’re actively working on expanding format support:Coming Soon
- RTF (Rich Text Format) - ODT (OpenDocument Text) - HTML to PDF conversion
Under Consideration
- PowerPoint to PDF - Excel to PDF - Markdown to PDF
Have a specific format requirement? Let us know and
we’ll consider adding it to our roadmap.
Best Practices by Format
Optimizing DOCX for Conversion
- Use standard fonts - Arial, Times New Roman, Calibri work best
- Embed custom fonts - Ensure fonts are included in the DOCX file
- Optimize images - Use appropriate resolution (300 DPI for print, 150 DPI for web)
- Simplify layouts - Avoid overly complex table structures
- Test compatibility - Preview in different Word versions
PDF Output Optimization
- Consider end use - Web viewing vs. print requirements
- Monitor file size - Balance quality with download speed
- Verify links - Test hyperlinks in the output PDF
- Check accessibility - Ensure proper reading order
- Validate structure - Confirm all elements converted correctly
Technical Specifications
DOCX Processing
- Engine: LibreOffice-based conversion
- Memory limit: 1GB per conversion
- Timeout: 60 seconds maximum
- Concurrent processing: Load-balanced across multiple servers
PDF Generation
- Renderer: Gotenberg PDF engine
- Compression: Automatic optimization
- Color space: RGB (can convert to CMYK on request)
- Resolution: Maintains source image DPI up to 300 DPI

